Realization of Minimum Discursive Units Segmentation of Arab Oral Utterances

نویسندگان

  • Chahira Lhioui
  • Anis Zouaghi
  • Mounir Zrigui
چکیده

Unlike the written texts, discourse segmentation of the Arab oral dialogues is a challenging task that is held back in most cases by the spontaneous character of oral speech. Like any segmentation task, segmentation in minimum discursive units (UDM) aims to cut the different statements of a speech into simple proposals easily usable in subsequent treatment. The majority of the work on the Arabic language was based on extensive syntactic analysis approaches. In this article, we try to show the effectiveness of hybrid approaches combining linguistic and probabilistic processes over purely linguistic approaches. The performance of our segmentation was evaluated on a relatively large size corpus. We built this corpus by using the method of the wizard of Oz.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audio Speech Segmentation Without Language-Specific Knowledge

Speech segmentation is the problem of finding word boundaries in spoken language when the underlying vocabulary is still unknown. Here we show that a system with no phonemic knowledge can find word boundaries. The system first subdivides an utterance by recursively clustering similar parts of the signal together until the cepstral coefficient variance is low within each new segment. These segme...

متن کامل

The Challenges of the Elections Systems of Persian Gulf Arab Countries

This article intends to clarify views regarding important challenges that have originated from the political, social, cultural and geopolitical structures in the elections systems of Persian Gulf Arab countries. Challenges that determine the compatibility levels of elections systems of these countries with the world’s democratic systems. An efficient elections system is the prerequisite for the...

متن کامل

Automatic initial and final segmentation in cleft palate speech of Mandarin speakers

The speech unit segmentation is an important pre-processing step in the analysis of cleft palate speech. In Mandarin, one syllable is composed of two parts: initial and final. In cleft palate speech, the resonance disorders occur at the finals and the voiced initials, while the articulation disorders occur at the unvoiced initials. Thus, the initials and finals are the minimum speech units, whi...

متن کامل

Textuality: The ‘form’ to Be Focused on in SLA

Due to the special (procedural) nature of the language (verbal communication) ‘knowledge’, the dominant trends in applied linguistics research in the last few decades have been advocating ‘acquisition’ rather than ‘learning’ activities where the main focus in SL & FL education should be on ‘meaning’ while some ‘focus-on-form’ being justified. But the ‘form’ to be ‘focused-on’ is mostly misconce...

متن کامل

Automatic Labeling of Corpora for Speech

One of the bottlenecks in the development of text-to-speech synthesizers based on segment concatenation is the need for large, segmented and labeled corpora. Consequently, as manual segmentation and labeling is a tedious and time consuming task, there is a strong demand for automatic labeling systems which can label speech from many languages. Several systems have been proposed already, but the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. Comput. Linguistics Appl.

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2016